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TITLE OF THE INVENTION 
Methods, Systems, and Products for Verifying Integrity of Web-Server Content 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application relates to the concurrently filed and commonly-assigned United States 
Application No. XX/XXX,XXX (amend with serial number) (Attorney Docket 030820), of 
which the "Brief Summary of the Invention" section and the "Detailed Description of the 
Invention" sections are incorporated herein by reference. 

[0002] This application also relates to the concurrently filed and commonly-assigned United 
States Application No. XX/XXX,XXX (amend with serial number) (Attorney Docket 040056), 
of which the "Brief Summary of the Invention" section and the "Detailed Description of the 
Invention" sections are incorporated herein by reference. 

NOTICE OF COPYRIGHT PROTECTION 

[0003] A portion of the disclosure of this patent document and its figures contain material 
subject to copyright protection. The copyright owner has no objection to the facsimile 
reproduction by anyone of the patent document or the patent disclosure, but otherwise reserves 
all copyrights whatsoever. 

BACKGROUND OF THE INVENTION 
1 . Field of the Invention 

[0004] This invention generally relates to electrical computers and, more particularly, to data 
processing system error/fault handling and to multicomputer data transferring. 
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Description of the Related Art 



[0005] Broken web links are a common problem in the Internet industry. Internet web pages 
are electronic documents that are retrieved from a web server. A Uniform Resource Locator 
(URL) identifies the server that stores/hosts the web page. These web pages, however, may 
themselves contain embedded links to additional content. When a web page is retrieved from the 
web server, all the component parts linked to that web page should also be retrievable. Often 
times, however, one or more links to additional content are broken. That is, when the web page 
is retrieved, one or more of the embedded links refer to non-existent web pages or web pages 
with errors. Sometimes the embedded URL address is incorrectly typed. Sometimes the servers 
storing the embedded content is malfunctioning or not available. Sometimes the servers serve 
partial pages due to components of the server code malfunctioning. Most times, however, the 
document located by the embedded URL is no longer stored on the web server. A failed 
embedded link often results in the familiar Error 404 "Page Not Found." When an embedded 
links fails, components are missing and the integrity of the web page is lost. The user requesting 
the web page is frustrated, and the web master responsible for the web page is frustrated. There 
is, accordingly, a need in the art for methods, systems, and products for verifying the integrity of 
content served by web servers. 



BRIEF SUMMARY OF THE INVENTION 



[0006] The aforementioned problems, and other problems, are reduced by this invention. This 
invention comprises methods, computer systems, computer programs, and computer program 
products that verify the integrity of web content. This invention analyzes integrity using a client- 
resident integrity checking tool and a server-resident integrity checking tool. This invention then 
combines the results of each integrity checking tool to systematically and to thoroughly discover 
broken links. Due to the volume of content being served by web servers, usually there is more 
than one server serving the same content in what are called "web farms." Certain intermittent 
errors, caused by a failure of a server in the web farm, are difficult to find with server-only or 
client-only methods. By combining the results from both client and server side tools this 
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invention provides a way to precisely identify these intermittent errors and enhances the quality 
of service. These broken links destroy the integrity of a web resource provided by a web server. 
This invention thus enables the user (such as an administrator and/or a web master) to present 
documents and content that are complete and without missing parts. If a web site has broken 
links to non-existing content, the customer has a poor experience and can become frustrated with 
the web site. These broken links may even cause some customers to choose another web site for 
goods and services. This invention, then, helps maintain a high quality web site that pleases and 
retains customers. 

[0007] This invention discloses methods, systems, and products for verifying the integrity of 
web server content. One of the embodiments describes a combined client-resident and server- 
resident content-integrity tool that helps identify integrity problems. One method receives results 
from a client-resident integrity program and from a server-resident integrity program. The client- 
resident integrity program operates on a client computer, while the server-resident integrity 
program operates on a web server. This invention merges the results from the client-resident 
integrity program and from the server-resident integrity program. The merged results are then 
presented at a computer system.. 

[0008] Other embodiments of this invention describe a system for verifying the integrity of web 
server content. The system comprises an Integrity Module stored in a memory device, and a 
processor communicating with the memory device. The Integrity Module receives results from a 
client-resident integrity program operating on a client computer. The client-resident integrity 
program verifies integrity of a web resource communicated from a web server to the client 
computer. The Integrity Module also receives results from a server-resident integrity program 
operating on the web server. The Integrity Module merges the results of the client-resident 
integrity program and the server-resident integrity program. The Integrity Module also visually 
and/or audibly presents the merged results at a computer system. 

[0009] Other embodiments of this invention describe a computer program product. A 
computer-readable medium stores an Integrity Module. The Integrity Module receives results 
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from a client-resident integrity program and from a server-resident integrity program. The client- 
resident integrity program operates on a client computer and verifies the integrity of a web 
resource communicated from a web server to the client computer. The Integrity Module merges 
the results of the client-resident integrity program and the server-resident integrity program and 
presents the merged results at a computer system. 

[0010] Other systems, methods, and/or computer program products according to embodiments 
will be or become apparent to one with skill in the art upon review of the following drawings and 
detailed description. It is intended that all such additional systems, methods, and/or computer 
program products be included within this description, be within the scope of the present 
invention, and be protected by the accompanying claims. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

[0011] These and other features, aspects, and advantages of the embodiments of the present 
invention are better understood when the following Detailed Description of the Invention is read 
with reference to the accompanying drawings, wherein: 

FIG. 1 is a schematic illustrating the embodiments of this invention; and 
FIGS. 2-4 are flowcharts illustrating a method of verifying the integrity of web server 
content, according to the embodiments of this invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[0012] This invention now will be described more fully hereinafter with reference to the 
accompanying drawings, in which exemplary embodiments are shown. This invention may, 
however, be embodied in many different forms and should not be construed as limited to the 
embodiments set forth herein. These embodiments are provided so that this disclosure will be 
thorough and complete and will fully convey the scope of the invention to those of ordinary skill 
in the art. Moreover, all statements herein reciting embodiments of the invention, as well as 
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specific examples thereof, are intended to encompass both structural and functional equivalents 
thereof Additionally, it is intended that such equivalents include both currently known 
equivalents as well as equivalents developed in the future (i.e., any elements developed that 
perform the same function, regardless of structure). 

[0013] Thus, for example, it will be appreciated by those of ordinary skill in the art that the 
diagrams, schematics, illustrations, and the like represent conceptual views or processes 
illustrating systems and methods embodying this invention. The functions of the various 
elements shown in the figures may be provided through the use of dedicated hardware as well as 
hardware capable of executing associated software. Similarly, any switches shown in the figures 
are conceptual only. Their function may be carried out through the operation of program logic, 
through dedicated logic, through the interaction of program control and dedicated logic, or even 
manually, the particular technique being selectable by the entity implementing this invention. 
Those of ordinary skill in the art further understand that the exemplary hardware, software, 
processes, methods, and/or operating systems described herein are for illustrative purposes and, 
thus, are not intended to be limited to any particular named manufacturer. 

[0014] This invention verifies the integrity of web content. This invention analyzes integrity 
using a client-resident integrity checking tool and a server-resident integrity checking tool. This 
invention then combines the results of each integrity checking tool to systematically and to 
thoroughly discover broken links. These broken links destroy the integrity of a web resource 
provided by a web server. This invention thus allows a user (such as an administrator and/or a 
web master) to present documents and content that are complete and without missing parts. If a 
web site has broken links to non-existing content, the customer has a poor experience and can 
become frustrated with the web site. These broken links may even cause some customers to 
choose another web site for goods and services. This invention, then, helps maintain a high 
quality web site that pleases and retains customers. 

[0015] This invention provides a more thorough integrity analysis. This invention is the first 
software product that allows a client-resident integrity checking tool and a server-resident 
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integrity checking tool to cooperate. That is, this invention merges the output from each separate 
operation to provide a complete integrity analysis. A user of this invention may thus view any 
errors, conditions, and/or events detected by either the client-resident integrity checking tool or 
the server-resident integrity checking tool. This invention may be configured to run at any time 
or even multiple times per day. When any type of integrity error is detected, the invention may 
even be configured to prominently present that integrity error. This invention, then, detects any 
number of errors that may identify integrity errors. The user of this invention may then quickly 
repair these errors to maintain the integrity of the web site. 

[0016] FIG. 1 is a schematic illustrating this invention. The embodiments of this invention 
include an Integrity Module 20. The Integrity Module 20 comprises methods, systems, computer 
programs, and/or computer program products that verify static and dynamically-generated web 
server content. The Integrity Module 20 receives client-side results 22 from a client-resident 
integrity program 24. The Integrity Module 20 also receives server-side results 26 from a server- 
resident integrity program 28. The client-resident integrity program 24 verifies the integrity of a 
web resource 30 communicated from one or more web servers 32 and 33 to a client computer 34. 
The server-resident integrity program 28 helps identify server-side errors that cause broken web 
links. The Integrity Module 20 merges these results, sorts these results, and then visually/audibly 
presents these results to a user at a user computer 36. As the following paragraphs explain, the 
Integrity Module 20 reports integrity errors in the web resource 30. The user at the user 
computer 36 may then use the Integrity Module 20 to identify and to resolve these integrity 
errors. 

[0017] The Integrity Module 20 communicates via a distributed computing network 38. The 
distributed computing network 38 may include the Internet (sometimes alternatively known as 
the "World Wide Web"), an intranet, a local-area network (LAN), and/or a wide-area network 
(WAN). The Integrity Module 20 operates within any computer system, such as the user 
computer 36. The client-resident integrity program 24 operates within any computer system, 
such as the client computer 34. The server-resident integrity program 28 also operates within any 
computer system, such as the web servers 32 and 33. Due to the volume of content being served 
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by web servers, the web servers 32 and 33 illustrate the server-resident integrity program 28 
operating in multiple servers in a web farm. The Integrity Module 20 receives the client-side 
results 22 via the distributed computing network 38. The Integrity Module 20 also receives the 
server-side results 26 via the distributed computing network 28. Once the client-side results 22 
and the server-side results 26 are received, the Integrity Module 20 arranges and presents the 
results for user at the user computer 36. 

[0018] The client-resident integrity program 24 and the server-resident integrity program 26 are 
not described in detail. If the reader desires a more detailed explanation of the client-resident 
integrity program 24, the reader is directed to the concurrently filed and commonly-assigned 
United States Application No. XX/XXX,XXX (amend with serial number) (Attorney Docket 
030820), of which the "Brief Summary of the Invention" section and the "Detailed Description 
of the Invention" sections are incorporated herein by reference. The server-resident integrity 
program 26, similarly, is not further described. If the reader desires a more detailed explanation 
of the server-resident integrity program 26, the reader is directed to the concurrently filed and 
commonly-assigned United States Application No. XX/XXX,XXX (amend with serial number) 
(Attorney Docket 040056), of which the "Brief Summary of the Invention" section and the 
"Detailed Description of the Invention" sections are incorporated herein by reference. 

[0019] The Integrity Module 20 arranges and presents the results. The Integrity Module 20 
receives the client-side results 22 via the distributed computing network 38. The Integrity 
Module 20 also receives the server-side results 26 via the distributed computing network 28. 
Once the client-side results 22 and the server-side results 26 are received, the Integrity Module 
20 merges, sorts, and presents the results at the user computer 36. The Integrity Module 20 may 
invoke various logical rules 40 when sorting and when presenting the merged results. These 
logical rules 40 are stored within the user computer 36 in a system memory device (such as a 
memory subsystem, flash memory, or a peripheral storage device). The logical rules 40, 
however, may alternatively or additionally be remotely stored and accessed via the distributed 
computing network 28. The terms "reports" and "presents" means the Integrity Module 20 
displays the merged results on a display device {e.g., computer display, personal data assistant, 
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cell phone, or other communications device). The terms "reports" and "presents" may also mean 
the Integrity Module 20 prints a hardcopy of the merged results. The terms "reports" and 
"presents," however, may mean the Integrity Module 20 stores the merged results for retrieval. 

[0020] The Integrity Module 20 uses the logical rules 40 to sort the merged results. The logical 
rules 40 can be written to report any condition, occurrence, and/or event that may indicate an 
integrity error. If, for example, the client-side results 22 and/or the server-side results 26 indicate 
an error message was found, this error message may indicate the existence of a partial page error. 
The logical rules 40, then, might instruct the Integrity Module 20 to prominently present and/or 
report the results associated with this error message. The logical rules 40 might also instruct the 
Integrity Module 20 to sort the merged results according to the error message. The logical rules 
40 may instruct the Integrity Module 20 to sort the merged results according to a referring 
Uniform Resource Locator and/or a requested Uniform Resource Locator. If the file size of the 
web resource 30 is smaller than expected, the logical rules 40 may instruct the Integrity Module 
20 to present/report an error. If the client-side results 22 and/or the server-side results 26 indicate 
a failed communication link with the web server 32, the logical rules 40 might instruct the 
Integrity Module 20 to prominently present the communication error. If either the client-side 
results 22 or the server-side results 26 indicate an error, the logical rules 40 might instruct the 
Integrity Module 20 to sort and to present the results according to a status code associated with 
the web resource 30. However the logical rules 40 are specified, the Integrity Module 20 can 
produce/present/report complete details of any condition, occurrence, and/or event that may 
indicate an integrity error. 

[0021] FIG. 2 is a flowchart illustrating a method of verifying the integrity of web server 
content. Results from a client-resident integrity program are received (Block 42). The results 
from the client-resident integrity program may include an error message indicating existence of a 
partial page error. Results from a server-resident integrity program are also received (Block 44). 
The results of the client-resident integrity program and the server-resident integrity program are 
merged (Block 46). If the error message indicates the existence of a partial page error (Block 
48), then the merged results are sorted according to the error message (Block 50). If the size of a 
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web resource is smaller than expected (Block 52), then the merged results are sorted according to 
the web resource and/or the file size (Block 54). 

[0022] The flowchart continues with FIGS. 3 and 4. If a communication link to a web server 
fails (Block 56), then the merged results are sorted according to the failed communication link 
(Block 58). If a status code indicates an error (Block 60), then the merged results are sorted 
according to the status code (Block 62). If a requested Uniform Resource Locator ("URL") is 
erroneous (Block 64), then the merged results are sorted according to a referring Uniform 
Resource Locator ("URL") (Block 66). If the web resource contains no data (Block 68 of FIG. 
4), then the requested Uniform Resource Locator ("URL") is erroneous (Block 70) and the 
merged results are sorted according to a referring Uniform Resource Locator ("URL") (Block 
72). The merged data may then be presented/reported according to the requested Uniform 
Resource Locator ("URL") (Block 74), the referring Uniform Resource Locator ("URL") (Block 
76), the error message (Block 78), the file size of the web resource (Block 80), and/or the status 
code (Block 82). Once the integrity error is detected, the user may repair the error and then 
discard the integrity error from future reports (Block 84). The user may thus eliminate repeatedly 
reporting the same integrity error even though repaired. 

[0023] The Integrity Module may be physically embodied on or in a computer-readable 
medium. This computer-readable medium may include CD-ROM, DVD, tape, cassette, floppy 
disk, memory card, and large-capacity disk (such as IOMEGA®, ZIP®, JAZZ®, and other large- 
capacity memory products (IOMEGA®, ZIP®, and JAZZ® are registered trademarks of Iomega 
Corporation, 1821 W. Iomega Way, Roy, Utah 84067, 801.332.1000, www.iomega.com). This 
computer-readable medium, or media, could be distributed to end-users, licensees, and assignees. 
These types of computer-readable media, and other types not mention here but considered within 
the scope of the present invention, allow the Integrity Module to be easily disseminated. A 
computer program product for verifying the integrity of web server content includes the Integrity 
Module stored on the computer-readable medium. The Integrity Module receives results from a 
client-resident integrity program operating on a client computer. The client-resident integrity 
program verifies the integrity of a web resource communicated from a web server to the client 
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computer. The Integrity Module also receives results from a server-resident integrity program 
operating on the web server. The Integrity Module merges the results of the client-resident 
integrity program and the server-resident integrity program and presents the merged results at a 
computer system. 

[0024] The Integrity Module may also be physically embodied on or in any addressable {e.g., 
HTTP, I.E.E.E. 802.11, Wireless Application Protocol (WAP)) wireline or wireless device 
capable of presenting an IP address. Examples could include a computer, a wireless personal 
digital assistant (PDA), an Internet Protocol mobile phone, or a wireless pager. 

[0025] While the present invention has been described with respect to various features, aspects, 
and embodiments, those skilled and unskilled in the art will recognize the invention is not so 
limited. Other variations, modifications, and alternative embodiments may be made without 
departing from the spirit and scope of the present invention. 
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